Discovering patterns of correlation and similarities in software project data with the Circos visualization tool

نویسندگان

  • Makrina Viola Kosti
  • Sofia Lazaridou
  • Nikoleta Bourazani
  • Lefteris Angelis
چکیده

Software cost estimation based on multivariate data from completed projects requires the building of efficient models. These models essentially describe relations in the data, either on the basis of correlations between variables or of similarities between the projects. The continuous growth of the amount of data gathered and the need to perform preliminary analysis in order to discover patterns able to drive the building of reasonable models, leads the researchers towards intelligent and time-saving tools which can effectively describe data and their relationships. The goal of this paper is to suggest an innovative visualization tool, widely used in bioinformatics, which represents relations in data in an aesthetic and intelligent way. In order to illustrate the capabilities of the tool, we use a well known dataset from software engineering projects.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Genome analysis J-Circos: an interactive Circos plotter

Summary: Circos plots are graphical outputs that display three dimensional chromosomal interactions and fusion transcripts. However, the Circos plot tool is not an interactive visualization tool, but rather a figure generator. For example, it does not enable data to be added dynamically nor does it provide information for specific data points interactively. Recently, an R-based Circos tool (RCi...

متن کامل

Circos: an information aesthetic for comparative genomics.

We created a visualization tool called Circos to facilitate the identification and analysis of similarities and differences arising from comparisons of genomes. Our tool is effective in displaying variation in genome structure and, generally, any other kind of positional relationships between genomic intervals. Such data are routinely produced by sequence alignments, hybridization arrays, genom...

متن کامل

A THEORETICALLY CORRECT RESOURCE USAGE VISUALIZATION FOR THE RESOURCE-CONSTRAINED PROJECT SCHEDULING PROBLEM

The cumulative resource constraints of the resource-constrained project scheduling problem (RCPSP) do not treat the resource demands as geometric rectangles, that is, activities are not necessarily assigned to the same resource units over their processing times. In spite of this fact, most papers on resource-constrained project scheduling mainly in the motivation phase use a strip packing of re...

متن کامل

Discovering unexpected information using a building energy visualization tool

In this paper we present a 3D visualization tool developed to gain insight about buildings energy consumption. We will focus on the usage of this software to extract information from a raw dataset, and more especially how unexpected features can be discovered. In this paper, we will present some related work on data visualization, our visualization tool and finally, two case studies of data ana...

متن کامل

Data Signatures for Validation and Evaluation of Temporal Associations

Discovering relations among domain variables from data plays an important role in automating and/or assisting the process of constructing domain models and validating existing ones. An important kind of relation is the temporal association between domain variables. While a straightforward application of correlation analysis may be insuucient for uncovering these relationships, we propose an app...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1110.1303  شماره 

صفحات  -

تاریخ انتشار 2011